i>B*B#it/f(jp) 



© if m- tb m & 



(a) ¥3-229386 



©Int. CI. s 

G 06 K 9/03 
9/46 



C 6942- 5B 
B 6942-5B 



7J£35K1991)]0£UB 



@$$ IS ¥2-23940 
@iiJ & ¥2(1990)2^2 0 

®* si a Hffiatts^a m^a5a^s : m c t 3 W2Tg9#32^ 



91 « # 

LX 2 fiSft<i^$-a5^-r . <I <0 

* * # a t -t S * * 2 a SI 8 . 
3 . Jkwcoftm&stty 



tiX&J*f-^<?>Jc¥ieinigifre>A h cr - ? £ *& 
ffit. z (?) x h n - ? <o ft ff . Aba-; 

Xho-^Jfi£l5^rj£^--?i:LT. * ¥ <0 16 IB 
*&&+$Zttz£->X1$lft$tlt:1if&&%¥l^ 



(b ) i±mmz ft ft + h • 

PoHtk « & « l i ~> x m&nmm 

/Pv i , v 2 £ £ £ L t zixh<n^7 h /KO * 
<n&0 &e tfh &mm# t m ± r & ft , 

6 . 09 fcf# 5 0 (a) Cf>tJ:7C**l 
#<OXho-?P l P n t Ltiffi t 
<0#2*«0Xhn-7PlPn. P m P n JC # $ 
SftT&aiS*.** £ li 55 5 0 (b) £.7*? £ 
9 C * 3* 2 *<0 X ha — 7 P n P m . P m P I # 

fcl±^&*55:»&L*:*ffcBHi;U 0 ® ii C * * 

£ . w*)2«gft<3#£K 

0r85fi<HLTflli2&/t*-;'<?* h a - ^ £ & ai 



?.f 63 ¥3-229386 ( 2) 





ED *5 , B * 


P L ^ MlfiS/l^iiP ( L + M 


> 


cl 95 St 7T 


v 3 t t <n&m 






# & £3 fi& 
























**f ia ;if M 1 
vf « xL VA U 


, X h n — ^ v l . v 


2 . v 3 <0 & 


ill 


ST TT V • 




- 
























£ . S3 It ft 




■fee L\ f* 


cox b n — 9 ^^fS* 




1 


X tf> <> v* 




■ L . » * t 




3EJC »^ « *B cr 




l . ^ — a 




a 1+ 11 *- 








i- CJ A ? »». 






CO 


m /S'J 2: fT 7 


* 




< 


Iff % 












» as s& s r ti 


tn 






df> ffi ffi K -#* 




*§ £ S * 















^<ftai^cfe(tiJfr«fi«t:ffl^4A^>-^^ 
JMLTA ha-^lfiaitltt^i&f^^^K^ff-r 

xho-r^«a5SJc*5tt4^iiie(at:^ 

<£M0l> 

* 



-596- 



Sft^^E 2t:l8tt**i.*rM^*-;xttA ho 
-?#aiS$3fc:J:9. 3540 < a > * *: ii (b) 



?.fC3¥ 3-229386 (3) 

3£ffsOBlf££5ff2@^7Er-*Y - K £ HI V> T 31 

± # l *r # a <o /< ? > -*<oto»fi£ia£*-&. 

y 2Ctelrt£iiTv**;fc^'<*-:'0:fc*1iftiS& 

fit, #a*ttas&4li. ft^iitXha- 

& 5 *s & n ■** * . 

* - S: <0 $ £ li , S)»#6tit*93gL{El8t£-?- 

x y ; it iKttTt^titf , xhn-;jtjfiSi 

3C&£-r*#'<^*-*<Ofi£3E3eL. Xho 



*i . * ^ s sa o bs . sa^iso*^ttAhn- 
7 am ^^^-^$-35seLT^ao©fr 

4. @ifi«>ffi#&391 

a - y - h % 3&3@{ifc*0:fc*Sg5££-r7 
+ »40(a) , ( b > li * * * SB 

a*^*^xha-?£&a;-f*«££a93T* 



1 : X « £ M 0 . 2 ;iftn»J, 

3 ;Xhn-;7ifta;SK 4 : *l © S J* £15 3 . 

5 ; S8 EH 25 , 6 ; JS5 # 55 . 

itsA * ^ m m 



-597- 



?$ra ¥3-229386 (4) 



L-T—i 



55 



r H 



xKo-7 



> 



Out 



















i 


£2. 







END 



3 




7 * 7 ^ t U t 





A*7X- 7 






0-7 J*£ 


















7 yi t 



( END ) 



-598- 



T5H ¥3-229386 (5) 




-599- 



PTO 04-4773 Japanese Kokai Patent Application 

No. Hei 3[1991]-229386 



CHARACTER RECOGNITION DEVICE 



Naoki Sano 



UNITED STATES PATENT AND TRADEMARK OFFICE 
WASHINGTON, D.C. AUGUST 2004 

TRANSLATED BY THE RALPH MCELROY TRANSLATION COMPANY 



1 



JAPANESE PATENT OFFICE 
PATENT JOURNAL (A) 
KOKAI PATENT APPLICATION NO. HEI 3[1991]-229386 

Int. CI. 5 : G06K 9/03 

9/46 

Sequence No. for Office Use: 6942-5B 

Filing No.: Hei 2[1990]-23940 

Filing Date: February 2, 1990 

Publication Date: October 11, 1991 

No. of Claims: 1 (Total of 5 pages) 

Examination Request: Not filed 



CHARACTER RECOGNITION DEVICE 



[Moji ninshiki sochi] 



Inventor: Naoki Sano 

Applicant: Yokogawa Electric Corporation 

[There are no amendments to this patent.] 

Claim 

1 . In a character recognition device that has a photoelectric conversion part that reads 
characters written on a plan, converts them to binary values, and outputs binary image signals; an 
image memory that stores the binary image signals as an original pattern; a stroke extraction part 
that traces the contour of the aforementioned original pattern, that makes a polygonal line 
approximation of the contour point series detected, and that extracts the strokes in the 
aforementioned original pattern; and a differentiation part that compares the feature values of the 
aforementioned original pattern and a feature value dictionary for standard characters stored in 
advance to recognize the character graphics; a character recognition device characterized in that a 
control part is furnished to instruct that the parameters used for the polygonal line approximation 
at the aforementioned stroke detection [sic; extraction] part be changed and that the operations 
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after stroke extraction be executed again if results that cannot be differentiated are output from the 
aforementioned differentiation part. 

Detailed explanation of the invention 
Industrial application field 

This invention relates to a character recognition device that reads characters written on 
handwritten plans and the like to extract the character contour strokes from that pattern and 
recognize character according to distinctive features of the strokes. In particular, it is intended to 
reduce incorrect recognition and improve the recognition rate. 

Prior art 

Representative character recognition systems include those where strokes are extracted 
from the character contour lines of the character patterns that are read, and character recognition 
operations are performed by extracting geometrical features, such as the stroke position and 
length, and interrelationships between strokes. 

Figure 3 is a flow chart that represents each of the parameters described below. 

First, each of the parameters described below is set and a character stroke extraction 
operation is executed. 

As one stroke extraction method, a method is widely used where a contour point series, that 
is, character contour lines, detected by tracing the character contour is approximated with 
polygonal lines, and the character strokes are extracted by combining the polygonal lines. 

Methods of selecting the break points (feature points) of the polygonal line approximation 
include the two methods shown in Figures 4(a) and (b). Note that in these figures, the white circles 
are pixels and the black circles are noted pixels which constitute character contour lines (LI) and 
(L2) [sic; (L3)] of the original pattern, respectively. 

Figure 4(a) is a method that uses curvature and Figure 4(b) is a method that uses distance. 

The method that uses curvature in Figure 4(a) traces pixels on character contour line (LI) 
and successively selects large curvature 0. 

That is, two vectors (vl) and (v2) that connect the pixels (P(-N)) and (PN) that are N pixels 
along contour line (LI) with the pixel (PO) as the center are assumed for finding the curvature on 
character contour line (LI) by finding the difference (6) in the directions of the vectors. If value (0) 
is at or greater than a certain threshold value (0t), the pixel (PO) is used as a feature point. 

The method that uses distance in Figure 4(b) extracts feature points so that distance (d) 
between graphic (v3) that is approximated linearly and original graphic (L3) will be minimal. 



That is, the distance (d) between line segment (v3), that connects pixel (PL) and pixel 
P(L+M) located M pixels away, and each pixel is found, and the point where value (d) is at or 
greater than a certain threshold value (dt) is used as a feature point. 

In this way the feature points on a character contour line are found and approximated with 
polygonal lines, and strokes (vl), (v2) and (v3) are extracted. 

Then, returning to the flow chart in Figure 3, the feature values for a stroke, that is, position, 
length, relative position, and the like, are extracted, and the character recognition operation is 
started. In the recognition operation, the feature values of a stroke and a feature value dictionary 
for reference characters that has been set in advance are referenced. When there is agreement with 
the dictionary, that recognition result is output, and when there is no agreement, the output is a 
reject. 

In this way, characters corresponding to the character patterns are differentiated. 

Problems to be solved by the invention 

However, with conventional stroke extraction processing, the parameters for polygonal 
line approximation, that is, number of pixels (N) and value (0t) when curvature is used, and 
number of pixels (M) and value (dt), are fixed. So when a character pattern varies from a standard 
character pattern, for example, a stroke that should be extracted as 1 original stroke (PIPn) is 
separated into two strokes (PIPn) and (PmPn) and extracted as shown in Figure 5(a). And as 
shown in Figure 5(b), two original strokes (PnPm) and (PmPl) are sometimes extracted as one 
stroke (PnPl). So stroke extraction overall is unreliable and can lead to a drop in the character 
recognition rate. 

The issue for this invention is to solve the problems described above and its objective is to 
perform reliable stroke extraction and achieve an improved character recognition rate. 

Means for resolving the issues 

This invention, which resolves the issues above, is a character recognition device that is 
characterized in that, in a character recognition device that has a photoelectric conversion part that 
reads characters written on a plan, converts them to binary values, and outputs binary image 
signals; an image memory that stores the binary image signals as the original pattern; a stroke 
extraction part that traces the contours of the aforementioned original pattern, approximates an 
extracted contour point series, and extracts strokes in the aforementioned original pattern; a feature 
value extraction part that extracts the geometric features of the aforementioned strokes; a 
differentiation part that compares the feature values of the aforementioned original pattern and a 
feature value dictionary for standard characters stored in advance to identify the character 
graphics; a control part is furnished to instruct that the parameters used for the polygonal line 
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approximation at the aforementioned stroke detection part be changed and the operations after 
stroke extraction be executed again if results that cannot be differentiated are output from the 
aforementioned differentiation part. 

Operation of the invention 

This invented character recognition device changes all the parameters used for polygonal 
line approximation at the stroke detection part and again executes the operation after stroke 
extraction to extract strokes anew and again attempt character recognition. 

Application example 

Figure 1 is a block diagram of a character recognition device that implements this 
invention. 

Character signal (In) is read by photoelectric conversion part (1) and converted to a binary 
value. The binary image signal is stored in image memory (2) as the original pattern for the 
character that was read. 

The original pattern stored in image memory (2) undergoes stroke extraction processing as 
shown in Figure 4(a) or (b) by stroke extraction part (3) and the result is transmitted to feature 
value extraction part (4). 

Feature value extraction part (4) extracts feature values, such as length, position or the like 
for the transmitted stroke and sends them to differentiation part (5). 

A feature value dictionary where feature values for standard characters are stored has been 
established in advance for differentiation part (5). The stroke feature values sent from feature value 
extraction part (4) and the feature values for standard characters in the dictionary are compared to 
perform character recognition. 

Control part (6) primarily has a function for outputting identification results (Out) and 
instructs stroke extraction part (3) to change parameters (SP) if identification results are output 
from differentiation part (5) that cannot be identified. Here parameters refers to the items used 
when the strokes in the original pattern are extracted by stroke extraction part (3) as described 
above. They are number of pixels (N) and value (0t) when curvature is used as the stroke 
extraction method, and number of pixels (M) and value (dt) when distance is used. 

Now the operation of this invented character recognition device that is constituted in this 
way will be explained using the flow chart in Figure 2. 

First, control part (6) sets initial values for each of the abovementioned parameters for 
stroke extraction part (3). 
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According to that, stroke extraction part (2) [sic; (3)] traces the character contour lines of 
the character pattern stored in image memory (2) to extract its strokes. The stroke extraction 
method may be a method that uses curvature or it may be a method that uses distance. 

Then feature value extraction part (4) extracts feature values from the length, position, etc. 
for the strokes that are obtained and transmits them to differentiation part (5). 

Differentiation part (5) compares the transmitted feature values with the contents of the 
internal dictionary. If they agree, control part (6) outputs the identification results (Out). 

If they do not agree, control part (6) checks the number of repetitions, and if the number is 
at or less than the value (K), it changes all parameter values set for stroke extraction part (3) and 
repeats the same operation again from stroke extraction processing. 

If recognition is not possible even after the parameters are changed and the operation for 
recognizing the character is repeated K times, control part (6) outputs [the result] as a reject. 

All parameters are changed to perform stroke extraction in this way. 

Effect of the invention 

As stated above, with this invented character recognition device, for character recognition, 
the parameters for stroke extraction are changed and the same operations are repeated a number of 
times if recognition is not possible. So reliable character recognition can be realized no matter how 
much the original character pattern varies. 

Brief description of the figures 

Figure 1 is a block diagram of a character recognition device that implements this 
invention. Figure 2 is a flow chart that represents the operation by this invented device. Figure 3 is 
a flow chart that represents conventional character recognition. Figures 4(a) and (b) are figures for 
explaining stroke extraction from character contour lines. Figures 5(a) and (b) are figures for 
explaining the problem with conventional character recognition systems. 

(1) : Photoelectric conversion part, 

(2) : Image memory, 

(3) : Stroke extraction part, 

(4) : Feature value extraction part, 

(5) : Differentiation part, 

(6) : Control part. 
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